Search Space Pruning Based on Anticipated Path Recombination in LVCSR
نویسندگان
چکیده
In this paper we introduce a well-motivated abstract pruning criterion for LVCSR decoders based on the anticipated recombination of HMM state alignment paths. We show that several heuristical pruning methods common in dynamic network decoders are approximations of this pruning criterion. The abstract criterion is too complex to be applied directly in an efficient manner, so we derive approximations which can be applied efficiently. Our new pruning methods allow much more exhaustive pruning of the search space than previous methods. We show that the size of the search space can be reduced by up to 50% at equal precision over the previous state of the art, and the RTF by 20%. The abstract pruning criterion can be considered a guide to derive effective pruning methods for any kind of time synchronous decoder.
منابع مشابه
Word pair approximation for more efficient decoding with high-order language models
The search effort in LVCSR depends on the order of the language model (LM); search hypotheses are only recombined once the LM allows for it. In this work we show how the LM dependence can be partially eliminated by exploiting the well-known word pair approximation. We enforce preemptive unigramor bigram-like LM recombination at word boundaries. We capture the recombination in a lattice, and lat...
متن کاملEfficient evaluation of the LVCSR search space using the NOWAY decoder
This work further develops and analyses the large vocabulary continuous speech recognition (LVCSR) search strategy reported at ICASSP-95 1]. In particular, the posterior-based phone deactivation pruning approach has been extended to include phone-dependent thresholds and an improved estimate of the least upper bound on the utterance log-probability has been developed. Analysis of the pruning pr...
متن کاملDecoder Technology for Connectionist Large Vocabulary Speech Recognition
The search problem in large vocabulary continuous speech recognition (LVCSR) is to locate the most probable string of words for a spoken utterance given the acoustic signal and a set of sentence models. Searching the space of possible utterances is difficult because of the large vocabulary size and the complexity imposed when long-span language models are used. This report describes an efficien...
متن کاملNovel LVCSR Decoder Based on Perfect Hash Automata and Tuple Structures – SPREAD –
The paper presents the novel design of a one-pass large vocabulary continuous-speech recognition decoder engine, named SPREAD. The decoder is based on a time-synchronous beam-search approach, including statically expanded cross-word triphone contexts. An approach using efficient tuple structures is proposed for the construction of the complete search-network. The foremost benefits are the impor...
متن کاملPruning Heuristics for Path Sensitization in Test Pattern Generation
A powerful combinational path sensitization engine is required for the efficient implementation of tools for test pattern generation, timing analysis, and delay-fault testing. Path sensitization can be posed as a search, in the ndimensional Boolean space, for a consistent assignment of logic values to the circuit nodes which also satisfies a given condition. While the conditions for path sensit...
متن کامل